^ . PCT/EPOO/ 05 26 t, . 

BUNDE5REPUBLI K DE UTSCHUyj[^^ 

PRIORITY 
DOCUMENT 

SUBMITTED OR TRANSMITTED IN 
COMPLIANCE WITH RULE 17.1(a) OR (b) 

ee oo| &2^1Bescheinigung 

Die BASF Aktiengesellschaft in Ludwigshafen/Deutscliland hat eine Patent- 
anmeldung unter der Bezeichnung 





"New calpains and use thereof (Neue Calpaine und 
deren VenA/endung)" 



am 18. Juni 1999 beim Deutschen Patent- und Markenamt eingereicht. 

Die angehefteten StCicke sind eine richtige und genaue Wiedergabe der ursprung- 
lichen Unterlagen dieser Patentanmeldung. 

Die Anmeldung hat im Deutschen Patent- und IVIarkenamt vorlaufig die Symbole 
C 07 K, C 07 H und C 12 Q der Internationalen Patentklassifikation erhalten. 



Munchen, den 30. Mai 2000 




BEST AVAILABLE COPY 



BASP JUctiengesj^scbafl: , 990121 0050/50066 DE 



# 



5 



Claims 

1. Polypeptide having the amino acid sequence SEQID NO:2 

2. Polynucleotide sequence coding for a polypeptide according 
to Claim 1 




3. Polynucleotide sequence having the sequence SEQID NO:l 

10 

4. Use of a polypeptide according to Claim 1 for identifying 
agents which can inhibit the enzymatic activity of this 
polypeptide. 

15 5. A method for identifying compounds which inhibit the 

enzymatic activity of a polypeptide according to Claim 1 
comprising 

(a) comparing the amount of enzymatic activity of CAPNll in 
2^ the presence of the compound with the amount of enzymatic 

activity of CAPNll in the absence of the compound and 

(b) selecting compoumds which change the amount of enzymatic 
activity of CAPNll compared to the enzymatic activity of 

25 CAPNll in the absence of the compound. 

6. Use of compounds identified by the method according to 
Claim 5 for treatment of male fertility disorders. 

30 




35 



40 
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New calpains and use thereof 

The invention relates to a new mammalian calpain, CAPNll, its 
5 synthesis and the use thereof. 

Calpains are a superfamily of related proteins some of which have 
been shown to function as calcium-dependent cysteine proteases. 
In mammals, eight different calpains have been identified. 

10 

Calpains constitute a family of intracellular calcium-dependent 
cysteine proteases. An increasing number of mammalian calpain 
homologs have been identified and individual members can be 
classified into four classes based on physical structure and 

15 predicted properties. The class A 'classical calpains«, CAPNl, 
CAPN2, CAPN3 (p94), CAPN8 (nCL-2) and CAPN9 (nCL-4) are probably 
all protease-active and Ca2+-dependent . They consist of a 

I variable large (SOkDa) and an invariant small (30kDa) s\abunit. 
The class B and D calpains CAPN5 (6,15) and CAPN7 (8) are 

20 protease-active but most likely Ca2+- independent, the class C 
calpain CAPN6 probably possesses no protease activity. Calpains 
can also be categorized on the basis of their expression patterns 
with CAPN3, CAPN6, CAPN8 and CAPN9 exhibiting some tissue- 
specificity. The function of the calpains is unknown although 

25 they have been associated with a wide variety of physiological 
processes and pathological conditions (reviewed in Ref. 17), 
Identification of the entire spectnam of calpain family members 
is essential to elucidate their function and evolutionary 
history. 

30 

The invention relates to the new polypeptide CAPNll having the 
amino acid sequence disclosed in SEQIDNO:2. 

^ The new calpain CAPNll protein possesses the features typical 
35 of calpains including potential protease and calciiom-binding 
domains. It exhibits a highly restricted tissue distribution 
with the major site of expression being testis. Radiation hybrid 
mapping localized the gene to human chromosome 6, within a region 
mapped to pl2. Phylogenetic analysis suggests that, in mammals, 
40 CAPNll is most closely related to CAPNl and CAPN2 . 

However, of the calpain sequences available, the predicted CAPNll 
sequence exhibits greatest homology to the chicken [i/m calpain. 
Thus CAPNll may be the human ortholog of fx/m calpain. The disco- 
45 very of this new calpain emphasizes the complexity of the calpain 
family with members being distinguished on the basis of protease 
activity, calcium dependence and tissue expression. 
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The cDNA nucleotide sequence of the* CAPNll gene contains 233 8 
nucleotides (SEQ ID MO:l). The cDNA sequence was confirined to 
be derived from a single mRNA by the successful amplification of 
the entire predicted coding region from human testis cDNA using 
5 flanking primers. Multiple cDNA clones were sequenced in their 
entirety to exclude any PCR-based artifacts. 

One large open reading frame exists which encodes a protein of 
702 amino acids (Mr of 80 kDa) (Fig. 1). The amino acid sequence 
10 shows similarity to the large subunit of members of the calpain 
family (Fig. 1). The protein can be subdivided into the four 
domains typical of calpain. Domain II shows the characteristics 
of a protease domain and the predicted amino acid sequence 
possesses the three amino acid residues (Cysl02, His259 and 
15 Asn2 83) which are part of the active site of cysteine proteases 
(2). All five Ca2+ -binding sequences reported for CAPN2 (4,12) 
are conserved to some extent in the amino acid sequence (Fig. 1). 
Thus, this protein is likely to have protease and calciiam-binding 
ability. A comparison of the predicted amino acid sequence with 
20 that of all other calpains revealed greatest sequence homology 
(57.5%) to chicken jx/m calpain. Of the mammalian calpains, the 
most similar was human CAPNl (54.3% homology). The least similar 
human calpain, with only 18.7% homology, was CAPM6. The gene 
corresponding to this cDNA has been designated CAPNll by the 
25 Human Gene Nomenclature Committee. 

The complete amino acid sequences of all the identified h\iman 
calpains were subjected to phylogenetic analysis. The results 
enable the hiornan calpains to be classified into four major 
30 evolutionary groups (Fig. 2). The first group is represented by 
CAPN5, CAPN6, CAPN7 and CAPN8, the second by CAPNl and CAPN2, the 
third group by CAPN3 and CAPN9 and the fourth contains CAPNll. 
Thus, the phylogenetic analysis suggests that CAPNll represents 
a distinct calpain subfamily. 

35 

Expression of CAPNll in human tissues was evaluated by Northern 
and RNA dot blot analysis. Of the 50 tissue RNAs tested, the 
CAPNll mRNA was expressed at highest levels in testis (Fig. 3A) . 
The specificity of this signal was verified by northern blot 

40 analysis and corresponded to an approximately 3Kb mRNA (Fig. 3D), 
Much weaker signals were detected in thymus and mammary gland but 
the significance of these is unclear as further investigation of 
thymus RNA using northern blot analysis did not reveal any signal 
despite long exposure times (Fig. 3D and data not shown). One 

45 possible explanation is that this weak signal is the result of 
cross-hybridization to related mRNAs. Nevertheless, testis is the 
major expression site of CAPNll although we cannot rule out the 
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possibility that the gene is expressed in other tissues which 
were not tested. 

We determined the chromosome location of the human CAPNll 
5 gene. Using PGR with primers specific for the human CAPNll 
nucleotide sequence the gene was mapped to chromosome six with 
a human/rodent somatic cell hybrid mapping panel (Correll Cell 
Repositories). With radiation hybrid mapping using the Stanford 
G3 medium resolution panel (Research Genetics Inc.) and the data- 

10 base at the Stanford Human Genome Center (shgc-www.stanford.edu) 
the gene was localized five cen ti rays from the marker SHGC-32834 
on this chromosome (LOD Score 12.87). This marker is located in 
the interval between the microsatellite markers D6S1616 (59.6 cM) 
and D6S427 (73.9 cM) (7) and a marker in this interval, D6S269, 

15 has been mapped cytogenetically to 5pl2 (5). Thus CAPNll is 

located on chromosome 6 in the vicinity of pl2. No other calpain 
gene has been located on this chromosome. 

The chicken \Ji/m calpain was the first member of the calpain family 
20 to be cloned (16). It was originally designated as m-calpain but 
reclassified after the identification of other chicken calpains 
most likely orthologous to the mammalian \i- and m-calpains (18), 
A mammalian \i/m calpain has remained elusive but as CAPNll 
shows greater homology to chicken |x/m calpain than to any of 
25 the mammalian calpains it may be its ortholog. 

There are now five calpains which exhibit some degree of tissue- 
specificity - CAPN3 (skeletal muscle), CAPN6 (placenta), CAPN8 
(possibly smooth muscle), CAPN9 (stomach and small intestine) 

30 and CAPNll (testis). Numerous proteases have been identified in 
testis and have been proposed to participate in processes such as 
tissue reorganization (20), regulation of spermatogenesis (14), 
penetration of the zona pellucida by sperm (10) and fertility 
(13). Many of these activities are, however, dependent on 

35 secreted proteases and CAPNll is likely to be localized intra- 
cellular ly. in testis it could participate in processes with 
which calpains have been associated in other tissues such as germ 
cell apoptosis (3) or regulation of testis-specif ic transcription 
factors. 

40 

Another aspect of this invention is the use of the polypeptide 
CAPNll for identifying agents which can inhibit the enzymatic 
activity of this polypeptide, so called calpain-inhibitors, 
especially those calpain inhibitors which are selecetive for 
45 CAPNll. Selectivity means that those calpain inhibitors inhibit 
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the activity of CAPNll stronger than the activity of other 
calpains mentioned above, preferably at least 10 times, more 
preferred 25 times stronger. 

The enzymatic activity of CAPMll is a Ca-dependent protease 
5 activity. 

Another aspect of the invention is a method for identifying 
compounds which inhibit enzymatic activity of a polypeptide 
according to claim 1 comprising 

10 

(a) comparing the amount of enzymatic activity of CAPNll in 
the presence of the compound with the amoxint of enzymatic 
activity of CAPNll in the absence of the compound and 

15 (b) selecting compounds which change the amount of enzymatic 
activity of CAPNll compared to the enzymatic activity of 
CAPNll in the absence of the compound. 

The inhibiting compounds identified by the method mentioned above 
20 are useful for the treatment of diseases which are associated or 
linked with an unphysiologically elevated activity of CAPNll such 
as male infertility. 

The dosage and the treatment regimen of those inhibitors have to 
25 be determined by routine procedures which are well known from 
other protease inhibitors. 
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15 Figures 

FIG. 1. Alignment of the predicted amino acid sequence of CAPNll 
to other human calpains. Multiple alignment of amino acid sequen- 
ces was carried out using CLUSTAL W (19). The proposed initiation 

20 methionine for CAPNll (GGAatgG) conforms to the minimal consensus 
sequence for the translational start site (RNNatgG, where R is 
a purine; Ref . 11) . Amino acids identical in the other proteins 
with those of CAPNll shaded. Dashes indicate gaps introduced to 
maximize alignment. Arrow heads indicate the three conserved 

25 amino acids which are part of the active site of calpains. The 
potential EF-hand calcium-binding domains of CAPN8 are underlined 
and numbered sequentially. The arbitrary domains of calpain are 
indicated. Not shown are the sequences for CAPN4 and CAPN7 which 
have only been identified in rat and mouse respectively. The 

30 published CAPN6 sequence was not included as it is only partial. 
The alternative names and accession numbers for the calpains 
aligrned are given in the legend to Fig. 2. 

FIG. 2. Unrooted phylogenetic tree of the human calpain large 
35 subunit family. Analysis was performed using the PAUP program 
and the tree assembled with CLUSTREE from the HUSAR server at 
the German Cancer Research Center, Heidelberg (www.dkf z-heidel- 
berg.de) . Lengths of horizontal lines are proportional to the 
inferred phylogenetic distances; vertical lines have no signifi- 
40 cemce- 1000 bootstrap repetitions were performed and values are 
shown at the inner nodes. The Capn7 sequence is that of the mouse 
as little human nucleotide and protein sequence is available. 
Nevertheless the human ortholog does exist (see Ref. 8) justi- 
fying the use of the mouse sequence for this comparison. The 
45 partial human CAPN8 sequence is the predicted translation from 
the EST clone AA026030 (Hillier et al . , 1995, The WashU-Merck 
EST Project, unpublished results) . An amino acid translation of 
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this clone exhibits high similarity to the rat CapnS sequence. 
No bootstrapping was performed with this sequence as it is much 
shorter than the others and thus the bootstrap value can not be 
meaningfully compared to the values derived from full-length 
5 sequence comparisons. The nomenclature specified by the Human 
Gene Nomenclature Committee is used- Previous names for the 
various calpains are: CAPNl - m-calpain; CAPN2 - m-calpain; CAPN3 
- p94, nCL-1; CAPM8, nCL-2 ; CAPN9, nCL,-4 . The EMBL accession 
numbers for the calpain sequences used are CAPNl (P17655) , CAPN2 
10 (P07384), CAPN3 {P20807), CAPNS (Y10656) , CAPN6 {Y12582), Capn7 
(AJ012475) and CAPN9 (AF022799)- 

FIG. 3. Expression of CAPNll. A 32P-labeled DNA probe containing 
an 800 base-pair segment of the human CAPNll cDNA coding sequence 

15 was hybridized to a Master Blot (A), a nylon filter containing 
dot blots of RNAs from 50 different human tissues, or a Clontech 
Multiple Tissue Northern Blot (D) . Filters were washed at high 
stringency (6xSSC, 65 jC) . The precise location of the various 
RNAs on the dot blot filter is shown schematically (C) . RNAs on 

20 the northern blot are indicated above the relevant lanes. The dot 
blot and northern blot were rehybridized with hximan ubiquitin 
(B) and b-actin DNA probes respectively to confirrm amounts of • 
poly(AH-) RNA loaded. For the northern blot, the position of the 
size markers {in kilobases) are indicated. The exposure times 

25 were: A, 72 hours; B, 24 hours; D, 48 hours. PBL. = peripheral 
blood leukocytes. 
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SEQUEasrCE LISTING 




<110> BASF Aktiengesellschaf t 

<120> Testis specific calpain 

<130> OZ0050/ 

<140> 
<141> 

<i60> 2 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 2338 

<212> DNA 

<213> Homo sapiens 



220> 
21> 5'UTR 
22> (1) . . (103) 




<220> 
<221> CDS 

<222> (104) . . (2212) 
<220> 

<221> 3'UTR 

<222> (2213) . . (2338) 

<400> 1 

ctccccaggg ccgagtcttc cggagtcagc agagagcctg gatggatcac aggaggataa 60 

gcctcggggc tcatgtgcgg agcccacttt tactgatacg gga atg gtg get cac 115 

Met val Ala His 
1 

ata aac aac age egg etc aag gee aag ggc gtg ggc cag cac gac aac 163 
e Asn Asn Ser Arg Leu Lys Ala Lys Gly Val Gly Gin His Asp Asn 
5 10 15 20 

gee cag aac ttt ggt aac cag age ttt gag gag ctg cga gca gee tgt 211 
Ala Gin Asn Phe Gly Asn Gin Ser Phe Glu Glu Leu Arg Ala Ala Cys 
25 30 35 

eta aga aag ggg gag etc ttc gag gac ccc tta ttc act get gaa ccc 259 
..Leu Arg Lys Gly Glu Leu PUe Glu Asp Pro Leu Phe Pro Ala Glu Pro 
40 45 50 

age tea ctg ggc ttc aag gac ctg ggc ccc aac tee aaa aat gtg cag 3 07 
Ser Ser Leu Gly Phe Lys Asp Leu Gly Pro Asn Ser Lys Asn Val Gin 
55 60 65 

aac ate tec tgg cag egg ccc aag gat ate ata aac aac cct eta ttc 355 
Asn lie Ser Trp Gin Arg Pro Lys Asp lie He Asn Asn Pro Leu Phe 
70 75 80 



ate atg gat ggg att tct cea aca gac ate tgc cag ggg ate etc ggg 



403 
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He Met Asp Gly lie Ser Pro Thr Asp He Cys Gin Gly He Leu Gly 

85 90 ' 95 100 

gac tgc tgg ctg ctg get gcc ate ggc tee ctt aec aec tge ecc aaa 451 

Asp Cys Trp Leu Leu Ala Ala lie Gly Ser Leu Thr Thr Cys Pro Lys 

105 110 115 



ctg eta tac cgc gtg gtg cce aga gga cag age ttc aag aaa aac tat 
Leu Leu Tyr Arg Val Val Pro Arg Gly Gin Ser Phe Lys Lys Asn Tyr 
120 125 130 




gtg cac tea ace gaa cgc agt gag ttc tgg agt gcc ctg ctg gag aag 
Val His Ser Thr Glu Arg Ser Glu Phe Trp Ser Ala Leu Leu Glu Lys 
^65 170 175 180 




agt gaa ctg gaa tec atg act gac aag atg ctg gtg aga ggg cac get 
"ler Glu Leu Glu Ser Met Thr Asp Lys Met Leu Val Arg Gly His Ala 
45 250 255 260 



499 



get ggc ate ttc cat ttt cag att tgg cag ttt gga cag tgg gtg aac 547 
Ala Gly He Phe His Phe Gin He Trp Gin Phe Gly Gin Trp Val Asn 
135 140 145 

gtg gtg gta gat gac egg ctg cce aca aag aat gac aag ctg gtg ttt 595 
Val Val Val Asp Asp Arg Leu Pro Thr Lys Asn Asp Lys Leu Val Phe 
150 155 160 



643 



gcg tat gcc aag ctg agt ggg tec tat gaa gca ttg tea ggg ggc agt 691 

Ala Tyr Ala Lys Leu Ser Gly Ser Tyr Glu Ala Leu Ser Gly Gly Ser 

185 190 195 

ace atg gag ggc ctt gag gac ttc aca gga ggc gtg gcc cag age ttc 73 9 

Thr Met Glu Gly Leu Glu Asp Phe Thr Gly Gly Val Ala Gin Ser Phe 
200 205 210 

caa etc cag agg cce ect cag aac ctg etc agg etc ctt agg aag gcc 787 

Gin Leu Gin Arg Pro Pro Gin Asn Leu Leu Arg Leu Leu Arg Lys Ala 
215 220 225 

gtg gag ega tec tec etc atg ggt tgc tec att gaa gtc ace agt gat 835 

Val Glu Arg Ser Ser Leu Met Gly Cys Ser He Glu Val Thr Ser Asp 
230 235 240 



883 



tac tct gtg act ggc ctt cag gat gtc cac tac aga ggc aaa atg gaa 931 
Tyr Ser Val Thr Gly Leu Gin Asp Val His Tyr Arg Gly Lys Met Glu 
265 270 275 

aca ctg att egg gtc egg aat cce tgg ggc egg att gag tgg aat gga 
thr Leu He Arg Val Arg Asn Pro Trp Gly Arg He Glu Trp Asn Gly 
280 285 290 



979 



get tgg agt gac agt gee agg gag tgg gaa gag gtg gcc tea gac ate 1027 
Ala Trp Ser Asp Ser Ala Arg Glu Trp Glu Glu Val Ala Ser Asp He 
295 300 305 

cag atg cag ctg ctg cac aag aeg gag gac ggg gag ttc tgg atg tec 1075 
Gin Met Gin Leu Leu His Lys Thr Glu Asp Gly Glu Phe Trp Met Ser 
310 315 320 



tac caa gat ttc ctg aac aac ttc aeg etc ctg gag ate tgc aac etc 



1123 
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Tyr Gin Asp Phe Leu Asn Asn Phe Thr Leu Leu Glu He Cys Asn Leu 
325 330 ' 335 340 

acg cct gat aca etc tct ggg gac tac aag age tac tgg cac acc acc 1171 
Tlir Pro Asp Thr Leu Ser Gly Asp Tyr Lys Ser Tyr Trp His Thr Thr 
345 350 355 

ttc tac gag ggc age tgg cgc aga ggc age tec gea ggg ggc tgc agg 1219 
Phe Tyr Glu Gly Ser Trp Arg Arg Gly Ser Ser Ala Gly Gly Cys Arg 
360 365 370 

aac cac cct ggc acg ttc tgg acc aac ccc cag ttt aag ate tct ctt 1267 
Asn His Pro Gly Thr Phe Tarp Thr Asn Pro Gin Phe Lys He Ser Leu 
375 380 385 

cct gag ggg gat gac cca gag gat gac gca gag ggc aat gtt gtg gtc 1315 
Pro Glu Gly Asp Asp Pro Glu Asp Asp Ala Glu Gly Asn Val Val Val 
390 395 400 



tgc acc tgc ctg gtg gcc eta atg cag aag aac tgg egg cat gca egg 13 63 
;ys Thr Cys Leu Val Ala Leu Met Gin Lys Asn Trp Arg His Ala Arg 
5 410 415 420 




cag cag gga gcc cag ctg cag acc att ggc ttt gtc etc tac gcg gtc 1411 
Gin Gin Gly Ala Gin Leu Gin Thr He Gly Phe Val Leu Tyr Ala Val 
425 430 435 

cca aaa gag ttt cag aac att cag gat gtc cac ttg aag aag gaa ttc 1459 
Pro Lys Glu Phe Gin Asn He Gin Asp Val His Leu Lys Lys Glu Phe 
440 445 450 

ttc acg aag tat cag gac cac ggc ttc tea gag ate ttc acc aac tea 1507 
Phe Thr Lys Tyr Gin Asp His Gly Phe Ser Glu He Phe Thr Asn Ser 
455 460 465 

egg gag gtg age age caa etc egg ctg cct ccg ggg gaa tat ate att 1555 
Arg Glu Val Ser Ser Gin Leu Arg Leu Pro Pro Gly Glu Tyr He He 
470 475 480 

att ccc tec acc ttt gag cca cac aga gat get gac ttc ctg ctt egg 1603 
le Pro Ser Thr Phe Glu Pro His Arg Asp Ala Asp Phe Leu Leu Arg 
5 490 495 500 

gtc ttc acc gag aag cac age gag tea tgg gaa ttg gat gaa gtc aac 1651 
Val Phe Thr Glu Lys His Ser Glu Ser Trp Glu Leu Asp Glu Val Asn 
505 510 515 

tat get gag caa etc caa gag gaa aag gtc tct gag gat gac atg gac 1699 
.Tyr Ala Glu Gin Leu Gin Glu Glu Lys Val Ser Glu Asp Asp Met Asp 
520 525 530 

cag gac ttc eta eat ttg ttt aag ata gtg gca gga gag ggc aag gag 1747 
Gin Asp Phe Leu His Leu Phe Lys He Val Ala Gly Glu Gly Lys Glu 
535 540 545 

ata ggg gtg tat gag etc cag agg ctg etc aac agg atg gcc ate aaa 1795 
He Gly Val Tyr Glu Leu Gin Arg Leu Leu Asn Arg Met Ala He Lys 
550 555 560 



ttc aaa age ttc aag ace aag ggc ttt ggc ctg gat get tgc cgc tgc 



1843 
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Phe Lys Ser Phe Lys Thr Lys Gly Phe Gly Le*a Asp Ala Cys~Arg uys 
565 570 575 580 

atg ate aac etc atg gat aaa gat ggc tct ggc aag ctg ggg ctt eta 
Met Xle Asn Ueu Met Asp Lys Asp Gly Ser Gly Lys Leu Gly Leu Leu 
585 590 595 



1891 



gag ttc aag ate ctg tgg aaa aaa etc aag aaa tgg atg gae ate ttc 1939 
Glu Phe Lys lie Leu Trp Lys Lys Leu Lys Lys Trp Met Asp lie Ptie 
600 605 610 




aga gag tgt gac cag gac cat tea ggc ace ttg aac tec tat gag atg 
Arg Glu Cys Asp Gin Asp His Ser Gly Thr Leu Asn Ser Tyr Glu Met 
615 620 625 

cgc ctg gtt att gag aaa gca ggc ate aag ctg aac aac aag gta atg 
Arg Leu Val lie Glu Lys Ala Gly He Lys Leu Asn Asn Lys val Met 
630 635 640 

cag gtc ctg g-tg gee agg tat gca gat gat gac ctg ate ata gac ttt 
In Val Leu Val Ala Arg Tyr Ala Asp Asp Asp Leu He He Asp Phe 
45 650 655 660 

gac age ttc ate age tgt ttc ctg agg eta aag ace atg ttc aca ttc 
Asp Ser Phe lie Ser Cys Phe Leu Arg Leu Lys Thr Met Phe Thr Phe 
665 670 675 

ttt eta ace atg gac ccc aag aat act ggc cat att tge ttg age ctg 
Phe Leu Thr Met Asp Pro Lys Asn Thr Gly His lie Cys Leu Ser Leu 
680 685 690 



1987 



2035 



2083 



2131 



2179 




gaa cag tgg etg cag atg acc atg tgg gga tag aggcgctgta ggagcctggt 2232 
Glu Gin Trp Leu Gin Met Thr Met Trp Gly 
695 700 

catctctacc agcagcagea gcagcgaggt tctagcccag gagggtgggg tgcttcttgt 2292 
agccctcagc tctccagtet ctgctgatga aatgggatcc aggtgg 2338 

210> 2 
211> 702 
212> PRT 
<213> Homo sapiens 

<400> 2 

Met Val Ala His He Asn Asn Ser Arg Leu Lys Ala Lys Gly Val Gly 
15 10 15 

Gin His Asp Asn Ala Gin Asn Phe Gly Asn Gin Ser Phe Glu Glu Leu 
20 25 30 

Arg Ala Ala Cys Leu Arg Lys Gly Glu Leu Phe Glu Asp Pro Leu Phe 
35 40 45 

Pro Ala Glu Pro Ser Ser Leu Gly Phe Lys Asp Leu Gly Pro Asn Ser 
50 55 60 



Lys Asn Val Gin Asn He Ser Trp Gin Arg Pro Lys Asp He Xle Asn 
65 70 75 80 
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Asn Pro Leu Phe lie Met Asp Gly He Ser Pro Thr Asp He Cys Gin 

85 90 95 

Gly lie Leu Gly Asp Cys Trp Leu Leu Ala Ala He Gly Ser Leu Thr 
100 105 110 

Thr Cys Pro Lys Leu Leu Tyr Arg Val Val Pro Arg Gly Gin Ser Phe 
115 120 125 

Lys Lys Asn Tyr Ala Gly lie Phe His Phe Gin He Trp Gin Phe Gly 
130 135 140 

Gin Trp Val Asn Val Val Val Asp Asp Arg Leu Pro Thr Lys Asn Asp 
145 150 155 160 

Lys Leu Val Phe Val His Ser Thr Glu Arg Ser Glu Phe Trp Ser Ala 
165 170 175 



Leu Leu Glu Lys Ala Tyr Ala Lys Leu Ser Gly Ser Tyr Glu Ala Leu 
180 185 190 




er Gly Gly Ser Thr Met Glu Gly Leu Glu Asp Phe Thr Gly Gly Val 
195 200 205 

Ala Gin Ser Phe Gin Leu Gin Arg Pro Pro Gin Asn Leu Leu Arg Leu 
210 215 220 

Leu Arg Lys Ala Val Glu Arg Ser Ser Leu Met Gly Cys Ser He Glu 
225 230 235 240 

Val Thr Ser Asp Ser Glu Leu Glu Ser Met Thr Asp Lys Met Leu Val 
245 250 255 

Arg Gly His Ala Tyr Ser Val Thr Gly Leu Gin Asp Val His Tyr Arg 
260 265 270 

Gly Lys Met Glu Thr Leu He Arg Val Arg Asn Pro Trp Gly Arg He 
275 280 285 

lu Trp Asn Gly Ala Trp Ser Asp Ser Ala Arg Glu Trp Glu Glu Val 
290 295 300 

Ala Ser Asp He Gin Met Gin Leu Leu His Lys Thr Glu Asp Gly Glu 
305 310 315 320 

Phe Trp Met Ser Tyr Gin Asp Phe Leu Asn Asn Phe Thr Leu Leu Glu 
325 330 335 

He Cys Asn Leu Thr Pro Asp Thr Leu Ser Gly Asp Tyr Lys Ser Tyr 
340 345 350 

Trp His Thr Thr Phe Tyr Glu Gly Ser Trp Arg Arg Gly Ser Ser Ala 
355 360 365 

Gly Gly Cys Arg Asn His Pro Gly Thr Phe Trp Thr Asn Pro Gin Phe 
370 375 380 

Lys He Ser Leu Pro Glu Gly Asp Asp Pro Glu Asp Asp Ala Glu Gly 
385 390 395 400 
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Asn Val val Val Cys Thr Cys Leu Val Ala Leu Met Gin Lys Asn Trp 
405 410 415 

Arg His Ala Arg Gin Gin Gly Ala Gin Leu Gin Thr He Gly Phe Val 
420 425 430 

Leu Tyr Ala Val Pro Lys Glu Phe Gin Asn He Gin Asp Val His Leu 
435 440 445 

Lys Lys Glu Phe Phe Thr Lys Tyr Gin Asp His Gly Phe Ser Glu He 
450 455 460 

Phe Thr Asn Ser Arg Glu Val Ser Ser Gin Leu Arg Leu Pro Pro Gly 
465 470 475 480 

Glu Tyr He He He Pro Ser Thr Phe Glu Pro His Arg Asp Ala Asp 
485 490 495 



Phe Leu Leu Arg Val Phe Thr Glu Lys His Ser Glu Ser Trp Glu Leu 
500 505 510 




sp Glu Val Asn Tyr Ala Glu Gin Leu Gin Glu Glu Lys Val Ser Glu 
515 520 525 

Asp Asp Met Asp Gin Asp Phe Leu His Leu Phe Lys He Val Ala Gly 
530 535 540 

Glu Gly Lys Glu He Gly Val Tyr Glu Leu Gin Arg Leu Leu Asn Arg 
545 550 555 560 

Met Ala He Lys Phe Lys Ser Phe Lys Thr Lys Gly Phe Gly Leu Asp 
565 570 575 

Ala Cys Arg Cys Met He Asn Leu Met Asp Lys Asp Gly Ser Gly Lys 
580 585 590 

Leu Gly Leu Leu Glu Phe Lys He Leu Trp Lys Lys Leu Lys Lys Trp 
595 600 605 

ipt Asp He Phe Arg Glu Cys Asp Gin Asp His Ser Gly Thr Leu Asn 
610 615 620 

Ser Tyr Glu Met Arg Leu Val He Glu Lys Ala Gly He Lys Leu Asn 
625 630 635 640 

Asn Lys Val Met Gin Val Leu Val Ala Arg Tyr Ala Asp Asp Asp Leu 
645 650 655 

He He Asp Phe Asp Ser Phe He Ser Cys Phe Leu Arg Leu Lys Thr 
660 665 670 

Met Phe Thr Phe Phe Leu Thr Met Asp Pro Lys Asn Thr Gly His He 
675 680 685 

Cys Leu Ser Leu Glu Gin Trp Leu Gin Met Thr Met Trp Gly 
690 695 700 
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